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1. Background 
-1,1, IPA Project 

The Instructional Process Analysis Project (IPA) is a study, 
undertaken jointly by the Nordic countries in 1967. The 
group of investigators has included Professor Bjerstedt 
(Malmo, "Sweden ) , Professor StukSt (Gothenburg, Sweden), Dr. 
Rasborg (Copenhagen, Denmark), Professor Sandven (Oslo, 
Norway) and Professor Koskenniemi (Helsinki, Finland). Col- 
laboration between the institutions mentioned has dealt with 
methodology and general exchange of information. Each insti- 
tution has been responsible for its own research design, and 
collection of material and analysis ..The present 'report is a part 
of bhe research directe^d by Prof. Koskenniemi, in which in- 
tensive follow-up study of a single class is the approach 
use'd. The first phase of the IPA^ project in Helsinki dealt 
with the attempt to define what variables can and do- oper- 
ate in teaching. If successful, the future application of 
variables describing significant aspects of interaction will 
help to determine in what degree measures of classroom com- 
munication can be used to indicate essential intervening mech- 
anisms when changes in the behaviour of pupils are examined. 

The following is a summary of the author^'s earlier publica- 
tions; 

1. Koskenniemi M. and Komulainen E. and others. 1969. 
Invest igat ion's into the Instructional Process : I . 
Some Methodological Problems. Res. Bull,, Inst, of 
Education, Univ. of Helsinki, No. 26. (Oct. 1969), 

2. Komulainen E. 1970. Investigations into the Instruc- 
tional Process: II. Objectivity of Coding in a Mod- 
ified Flanders Interaction Analysis. Ibid., No 27 
(Dec. 1970). 
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3. Komulainen E. 1971. Investigations into the Instruc- 
tional Process: III. P-techn iq^ Treatment of Obser- 
vational Data. Ibid., No 28 (Jan. 1971). 

4. Komulainen E. 1971. Investigations into the Instruc- 
t ional Process : IV. Teaching as a Stochastic Process. 
Ibid. , No 29 (Feb. 1971 ) . 

To create a synthesis of the earlier reports, suppl ementa ry 

results relating to certain methodological questions are given. 

Then main principles of the analyses performed may be clear 

from the present report,, although I naturally recommend read- 

1 } 

ers to acquaint themselves with the above publications , 
especially with that part 'that describes the methods empoy{3d. 
Studies by Koskenni emi , Komu lainen and Falck 1969 , by Komu- 
lainen 1970c, by Koskenniemi, Komulainen and others 1972, and 
'.by Koskenniemi ^1 972b also explain the research strategy. 

1. 2.. Place of This Investigation within the Study Field of 
the Instructional Process 

The ,present study discusses a methodological problem area of 
very narrow scope within the^ IPA project. The aim is, there- 
fore, to present a theoretical frame of reference by condensed 
reference to relevant literature. It is clea^" that in any 
methodological investigation the background theory of the 
category system with its attendant concepts and constructs 
is implicitly present. It should be stressed, therefore, that 
methodological problems a re not to be seen separately from 
the bro'ad and rapidly growing field of research into and theory 
of instruction. The general literature on this subject is 
extensive, and a number of bibliographies are also available. 
I assume the reader'^s awareness of the main features of this 
research area with regard to the following trends: 

1) Publications 2, 3 and 4 are also availabl.e from the ERIC 
document Reproduction Service, P.O. Drawer 0, Bethesda. 
Maryland 20014, U.S.A., in microfiche and (ready-to read) 
xerox copy form. ERIC document numbers are: ED 057 084, 
ED ^059 276 and ED 059 277 . 



The European study tradition. This may be cons ide red 
to have started with the pedagogical- fact research of 
Petersen. The tradition continued, as modified by 
Winnefeld, and has influenced the empirical study of 
teaching in many ways, especially in the German-speak- 
ing area (Reiniger 1924 and 1929; Mu 1 ler-Petersen 1952, 
Winnefeld 1957; Petersen & Petersen 1965; Rohrs 1968; 
Kessel 1970; Maier & Pfistner 1971 )\ In German Demo- 
cratic Republic particularly,- the part played by -ob- 
servational study ha's- substantially increased in the 
last few years (Roth 1969; Grassel 1970).= 
Study tradition of interaction analysis. The famous 
studies of classroom climate by Anderson and the re- 
search group of Lew in, Lippitt and* White are classics 
in the literature of educational psychology (Anderson 
1939; Lewin et al. 1939). With the studies of Withall 
(1949), and especially those of Flanders, it may be 
said that the school of interaction analysis came to 
birth (Amidon & Hough 1967; cf. also Flanders 1970). 
A knowledge of Flanders^ studies and of interaction 
analysis (FIAC) is vital to the understanding of this 
report, because the choice of a system of classifica- 
tion (FIAC) involves adherence to a theoretical frame 
of reference as its basis. Nor is the contribution of 
Bales (1959) without influence. The interaction process 
analysis of Bales (IPA) and the lAC system of Flanders 
are closely similar applications of the observational 
method. Interaction analysis today has an important 
field of application in the study of teaching and in 
teacher training (Flanders 1970; Ober & Bently & Miller 
1971 ) . 

Teacher efficiency studies. These are numerous, and 
among them it is advisable to be aware of the main 
results of such designs in which the variables for 
teaching are based on observation, and some kind of 



adjusted gain scores are used as a criterion variable 
(Rosenshine 1968; Flanders & Simon 1970; Flanders 1970, 
376-426; Nuthall 1970; Rosenshine 1971). 



4} Methodological questions raised by the observation pro- 
cess (Medley & Mitzel 1963; Weick 1968). 

The abuve classification is, of course, arbitrary and over- 
lapping. In practice there are no such mutually exclusive 
study traditions and schools of thought. Many studies deal 
with material from different fields. Chief emphasis in phB 
present report is on interaction analysis and questions, of 
method of observat ion . 




2j1. beneral Aim 



Problems concerning the construction of variables describing 
the instructional process have become strongly evident in re-- 
cent times . There are several reasons for t his . Some researchers 
regard attempts to elucidate the question of instructional ef- 
ficiency as premature, because the character of the independ- 
ent variable (instruction) is too little known . B. 0. Smith 
(1969) expresses the matter as follows: 

"All these considerations support the conclusion ,that 
teaching is to be studied in its own right if we would 
understand it and thereby gain control over it." 

De Landsheere (1971) presents another viewpoint which supports 

a- methodology based on taxonomic systems: 

"One of the main factors explaining the general failure 
of research trying to relate teacher behaviour to pupil 
achievement is the crudeness of the independent variables 
used." 

Apart from the degree of specificity of the variables, atten- 
tion should be paid to this problem: variables relating to 
instruction are often assembled in such a way that they indi- 
cate unconnected elements in the classroom communication. In- 
struction is a 'sequence of activities advancing in time, a 
process whose connection with the changing behaviour of its 
participants is cumulative. Consequently, the variables should 
be determined on the basis of the connection between the ele- 
ments appearing in the process, that is, on the context. At- 
tempts in this direction have been made. Tne Bellack cycle, 
the B. 0. Smith teaching cycle, Flanders' critical teaching 
behaviour and Nuthall's verbal move sequence may serve as ex- 
amples . 

The general aim of the present study is to examine the factor- 
analytical method of sequence analysis, and to produce vari*- 
ables that measure the appearance of sequential tendencies. 
We shall also examine the process from the starting point of 
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transition probabilities between the elements. These method- 
ological examinations are confined to the FIAC system. | 

2.2. Specific Aims 

The specific aims fall within three main areas: 

1) The reliability problem of measures of classroom behav- 
io.ur (Komulainen 1970b). The reliability of the coding 
forms an inescapable methodological question in obser* 
vational research. In the training and research work 

^ of observers certain standard solutions have been thought 

to be sufficient. In the'case of the FIAC system this 
is Scott's II. Our purpose is to examine the practical 
utility of this type of method. 

2) Within t he , framework of- lessons, sequences of inter- 
action .may occur which characterize only the lessons cf 
a certain type. We shall analyze these sequences by "the 
technique, of inverse factor analysis (Koskenniemi & 
Komulainen 1969). 

3) Within the framework of lessons, sequences of inter- 
action appear whose basic^ form remains the same, but 
whose relative -^^requency varies fxom lesson to lesson.^ 
Factor analysis will be applied tc identify such se- 
quences if passible, and to form quantities indicating 
their tendency to appear (Komularnen 1971a). Sequences 
are also analyzed by examining probabilities of tran- 
sitions from one state to another (Komulainen 1971b) . 
We thus attempt to define more precisely the sequence 
of events arising from successive states within the 
process . 

2.3. The Aims of This Report 

The first part of this report summarizes the contents of 
earlier reports. The aim is to present the main features of 
reliability examination and the factor-analytical approach 
: for the investigation of behavioural sequences. In order to 

f create a more penetrating sythesis, various special questions 

t are examined which have received less attention in earlier 




reports. A further aim is to use the devices offered by 
Markoi/ chains and by stochastic information theory as a means 
of elucidating the degree of dif f erentat ion of behavioural se- 
quences. It is hoped thereby to obtain a more precise and ana- 
lytical picture of the preliminary results based on transi- 
tion probabilities (Komulainen 1971b). 

2>4, Study nateriai 

The material was video-taped in the laboratory class of the 
Inst^itute of Education, Univ. of Helsinki, in the autumn term 
of 1967. It consisted of 25 lessons covering five subjects of 
the regular third grade curriculum. All lessons were recorded 
with th^ same class. Questions of validity are of central im- 
portance In the material of this type. They have been dis- 
cus.sed elswhere (Komulainen 1970a, 72-79). At the time of 
writing, the study is continuing in accordance with both the 
plans . introduced in Chapter S. 

The taxonomic instrument was a modified FIAC system (App. \), 

Earlier reports have discussed only the class instruct i-rn les- 
sona coded by FIAC, which have bsen mentioned above. The pres- 
ent report uses for comparative purposes the material gathered 
by Holopainen (1971, 45-48 and 1972, 4-5) for his study of ' 
group work, and 96 regular class instruction lessons from 
school terms 1967-69, both coded with Bales" IPA. Comparative 
materials are used only in Section 5.5. 
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3, Reliability 

The nominal scale of codings makes it difficult to apply nor- 
mal methods of reliability assessment to measurements obtained 
by interaction analysis. Tha techniques for assessing errors 
of measurement are thus more developed than those for meas- 
uring errors of classification (Reiss 1971, 21). The defini- 
tion of reliability as a ratio of true to total variance is 
not applicable to such a, situation (Valkonen 1971, 60). 

In' Report No 2^ (Komulainen 1970b) the reliability of co.dings 
was assessed as follows: in addition to 'Scott's II coeffi- 
cient based on marginal distributions (Scott 1955), agreement 
coefvicients for categ^ories were calculated by the author's 
method. The normal code/3 sec. procedure was reduced to a 
simple yes/no for each category. It is then possible to cal- 
culate an agreement coefficient for each category. It also 
ensures that the decision of" each coder is based on the same 
time unit. The background for this coefficient is introduced 
in Report No. 2, pagesM3-15. Formulae with symbols from set 
theory are given on pages 16-17. Using 0/1 codes (0 = cate- 
gory not present in the time unit; 1 - category present in 
the time unit), formulae can be simplified as follows: 



Agreement for the category between coders A and B is obtained 
from 



(1) A= ^ 



EA ^ EB_ 



i 

f' 

i 
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Agreement due to chance is calculated from the starting point 
of the statistical independence hypothesis. The multiplica- 
tion rule of the classic probability calculation is then ap- 
plied. The chance agreement can be computed from 



(2) - Pa-Pb 

using relative frequencies or 



(3) 



ZA * IB 



"^able 1 shows the calculation procedureby means of a simple 
example. 
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Table 1. Calculation of the Agreement Indices A and A- 
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In Report No. 
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were 


used 


as 


such , 


because 





coefficients remained relatively low (range .00 - .14; 
Komulainen 1970b, 19). It is possible for agreement coeffi- 
cients to be so consructed as to allow for the fact that 



part of the agreement (A) was caused by chance (A^), The 
corrected A coefficient could be calculated from them in the 
manner of Scott's II. 
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(4) Corrected A - ^ 



A - A. 
c 



1 - A 

r 



In the case of frequent categories (6 and 9} this correction 
would have slightly lowered the coefficients. This is negli- 
gible, however, so that the correction was considered unnec- 
essary . 

Scott's II coefficient calculated from marginal distributions 
almost the only index used to indicate the reliability of FIAC 
II can be calculated quickly and easily: simple nomograms can 
be drawn up for its graphic estimation. This is necessary dur- 
ing the training of coders to ensure a swift feed-back. The 
author's view, based on a comparison, is that minimum profi- 
ciency Scott's II> .85 is a relatively conservative require- 
ment. If broader category systems are used its attainment may 
be difficult. Comparison between these indi ces . shows that the 
use of Scott's II is justified, although it is not based on 
cross-tabulation from common units. The danger of mutually 
compensating errors (or identical marginal distributions de- 
spite a divergence of unit by unit codings) did not appear 
serious in the light of the results obtained by me earlier. 

The reliability of observation will remain a persistent metii- 
odological problem, because the value of results depends cru- 
cially on the accurate use of the rretalang»i£-^^e m. i,he coding 
process. Care should be taken not to align the reliability 
problem of category systems with the normal measurement on 
the quant itative scale, where reliability is defined as the 
ratio of true to observed variance. Functioning with ipsative 
nominal scales involves factors which complicate the classic 
treatment of measurement errors. The error does not then en- 
large the variance, as with quantitative variables, but pro- 
duces a futile or "worse" variance (cf. Valkonen 1971, 61). 
A high degree of agreement in classification is easy to attain 
by using few categories. The measurement of concepts by means 
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of such a classification remains crude and deficient in validity. 
As the validity problem is often very complex, reliability has 
been the means of demonstrating the utility of the whole system 
of classification. The classic theory of measurement errors 
regards reliability as a necessary but insufficient precon- 
dition of validity. In connection with categorization, the 
number of classes and subdivisions in the category system can 
also be regarded as a prequisite of validity. A set of four 
to six categories is the probable minimum in this respect. 

In connection with measurements based on classification, lack 
of reliability does not mean that the majority of classifica- 
tions occur by chance. The coder's individual and unique man- 
ner of interpreting the situation and using the metalanguage 
of the coding system is an additional factor causing disagree- 
ment: it may remain constant intraindi vidual ly yet vary from 
one coder to another. 

/ \ / / 

reliable coder^'s chance 

unique 

systemat i c . ran dom 

There is reason to take seriously the warning of Weick, who 
points out that the observer forms an incalculable factor in 
any observational system (1968, 360). Emphasis on the unique 
quality of the coder's contribution leads to uncertainty over 
the extent to which the same concepts correspond to each other 
in different studies (Rosenshine 1970, 120). 



- 13 - 



4, Exploring the Sequences through Factor Analybic Techniques 
4,1. Summary of Nethodology 

The trensition matrix was examined by factor analysis in order 
to ascertain wito+i transition clusters (sequences) are to be 
found. Both inverse and normal factor analysis were used. 
Because the units are only one in number (the same system with 
measurements in time t^ ... t^^] , they are called 0 and P 
t^echniques (Table 2). 

Table 2. Factor Analytic Techniques (Cattell 1952, 109) 



Data 
plane 


Technique 


Factor 
regarded 
as con- 
stant 


Correlated 
and f actorized 


Nature of 
factors 




R 




Variables 


Trait factors 


V - P 




S 








0 




Units 


Type factors 




P 




Variables 


State factors 


V - P 




u 








0 




Situations 


Situation fac- 
tors 




T 




Situations 


Situation fac- 
tors 


P - S 




V 








S 




Units 


Type factors 



[ V = variable 

I P=unit(personetc) 

I S=situation(t^...) 

I 0 technique . The same system (group, organism, individual etc.) 
I ^ IS examined by making repeated measurements of the same vari- 

I ables in different situations which are then correlated. In 

% 

i 
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the present study, the interaction matrices from the identical 
class of students but produced by different situations were 
correlated with each other. The sequences were constructed 
as follows : , 

1) The two lessons with the strongest positive factor loading 
were sought; also the two with the strongest negative load- 
'ing. A strong bipolarity was expected in view of the the 

ipsative nature of the data matrix. 

2) Variables were identified which were parallel (as standard 
deviations from the arithmetic mean] in the positive load 
lessons and, correspondingly, in the negative load les- 
sons, but which separated the lessons of different sign 
from each other (Figure This was best performed by 
drawing the standardized variableprof i les of all four les- 
sons on the same graphic illustration and making compar- 
isons withit. 

3] From the variables discovered (e.g. 9-1, 1-3, 3-3, 3-6) 
sequences were constructed to illustrate separately the 
process of the lesson type for each pole of the factor 
(Koskenniemi i Komukainen 1969, 18-28]. 

Figure 1. Interpretation Procedure for the 0 Technique 

Identification of principal lessons of factor on basis 
of factor loadings positive and negative 





comparison of 
variable pro files; 
si mi la ri ties 



^ f i les ; simil- 
arities 



comparison of 
variable pro- 




erences 



comparison of 
variable pro- 
f i les ; dif f - 




essential 
variables for 
positive load 
lessons 



essential 
variables for 
negat ive load 
lessons 




into flow 
diagram in 
each group 



const ruction 
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The 0 technique interpretation could presumably be improved 
and freed from subjective errors by calculating factor scores 
for the variables and using them in the interpretation. An- 
other method would be to correlate factor loadings of the les- 
sons with other varia&les. The emerging correlation matrix 
would identify the essential variables for each type (Baumann 
1971. 1741. It is probable that the instability of inter- 
pretation within the context of inverse factor analysis could 
be substantially reduced by applying these different methods. 

P technique. In the same system, repeated measurements are 
made from a number of variables in different situations. The 
variables are intercorrelated. and factor analysis with 
varimax rotation is performad (Komulainen 1971a]. The inter- 
action sequences were elucidated as follows: 

11 High-positive and high-negative load variables for the 

factors were sought and arranged in their own groups. 
21 Chains formed hy transitions were constructed for 

each factor pole and their features presented as flow 

diagrams (Komulainen ig71a. 16-251. 

Figure 2. Interpretation Procedure for the P technique 
Variables | | t t t t t 

m tttt 



Factors 



Interpretation y ^ , ^ 
as sequences 



o 
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4.2e Some methodological problems 



For each lesson, a normal interaction matrix was formed, 
whose cells were used as variables. Because the interaction 
accumulates heavily in certain areas of the matrix, 50 out 
of the total 169 possible cells ware selected for treatment 
(Komuloinen 1971a, 12-14). 

Figure 3. Variables selected from interaction matrices 





1 


2 3 


4 


5 


6 


7 


8 


9 


10 


11 12 13 


1 




5 


7 


13 


16 


25 




35 






2 










17 












3 




6 ■ 






18 












4 






8 










36 




45 


5 








14 








37 






6 






9 


15 


19 


26 


31 


36 


42 


46 


7 






10 




20 


27 


32 


39 




47 


8 










21 


28 


33 






48 


9 


1 


3 


11 




22 


29 




40 




49 


10 


2 


4 






23 








43 




1 1 




















1 


12 






12 




24 


30 


34 


41 


44 




13 




















50 j 


The 


selected variables 


contain some 


85 


% of 


transitions of 


the 


matrix. Certain 


prob lems 


are 


ra| 


£:ed 


when 


cells of the 



millage matrix are used as variables in factor analysis. 



N/p ratio. In the present study the number of observations 
is very small compared with the number of variables. The 
invariance of the factor structure from one sample to an- 
other thus becomes a problem. Dawley & Maxwell (1971, 63) 
note that it is a safe rule to require N - p>50. Pawlik 
(1971, 276) for his part writes that, on the eviuence of 
methodological experience, factor loadings vary far more 
than correlation coefficients from one sample to another, 
so that a factor analysis with N <60 should not be performed. 
Cattell mentions a 4-to-1 rule of thumb as the minimum 
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(1952, 345), although he notes that the most difficult problem 
of the P technique in particular is to ensure an adequate 
amount of repeated measurements. When using the P technique 
Cattell himself has been obliged to compromise with regard 
to this minimum (e.g. Cattell et al. 1947). With extremely 
small N/p ratios we are brought back to the fact that the 
rank of the data matrix is less than or equal to the smaller 
side of the data matrix. The rank is equal to the number of 
factors that can be delineated. Pummel (1970, 2201 expresses 
the view that when the interest is only in describing data 
variability, factor analysis will yield such a description 
regardless of the fact that variables exceed cases in number. 
In the present study, however, observations used in the P 
technique may be regarded as aggregates obtained by adding. 

Figure 4. Total Matrix Calculated as Sum of Parts 
summation 




each of the matrices to 
be added together, pro- 
duced from 5 minutes' 
observati on 



total matrix 



The total millage matrix may be described as a sum matrix 
formed from the parts of a lesson. Selection of the total 
matrix as a unit of observation is based on the following 
arguments, among others; 
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1) Distributions of variables from total matrices were 
nearer to a symmetrical, unimodal form than dis- 
tributions from partial matrices, where positive 
skewness - especially in variables on the diagonal 
(steady state cellsl ^ might grow diturbingly strong, 

2) Partial matrices would remain too scanty in the ob- 
servation material. If 500-600 acts of a lesson are 
divided into, say, six parts and a 13 x 13 matrix is for- 
me dfromeach, it is clear that such a matrix is highly 
unstable (expected mean frequency 0.6 transition/cell). 

3) If intercorre la t ions between variables calculated 
over total matrices are high among certain contingen- 
cies, it may be presumed that the transitions in 
question have occurred during the lesson so close to 
each other in time that they may be regarded as be- 
longing to the same behavioural sequence. The sim- 
ilarity of factor structures calculated for matrices 
of different length (5 min^/ '^O min., etc.) will give 
empirical support to the hypothesis. 

Correlations calculated for the aggregate, i.e. total matrix, 
are in general higher than those calculated from parts of 
the aggregate. In such a case the increase in the intercorre- 
lations is due to the disappearance of a part of the random 
variation in the values of smaller units around the regres- 
sion line (Valkonen 1971, 511. 

Ipsativism of measurements and form of distributions . Doubly 
standardized data have been label led ipsative in contrast to 
normative (Cattell 1966a, 115-1161. Dubois calls such matrices 
closed systems, and points out that such system may cause 
diffuculties of interpretation in factor analysis (19571. 
Those matrices may also he considered ipsative whose row 
sums add to a constant from one observation to another, Horst 
mentions that the' influence of such centering on the rank of 
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the matrix is of considerable importance (Horst 1965, 294). 
He also developes methods of deriving the factors for one 
scaling of the data matrix from the factors of a different 
scaling (Horst 1965, 295-314). Baumann notes that the form 
of the primary set of data is decisive for the emerging 
factor structure. From the centred matri ces emerge fewer but 
strongly bipolar factors (Baumann 1971, 178). 

Figure 5. Emergence of Bipolar Factors from Centred Matrices 



The variables of the present study can be regarded as ipsative. 
The 50 cells selected from the interaction matrix contain on 
the average 85 % of all transitions.^ No variable therefore 
can be presented as a perfect linear combination of otTler 
variables, but within certain limits the row sums of the! data 
matrix reach a constant. The ipsative nature of the variables, 
combined with the fact that for some observations the entire 
row sum may consist of an exceptionally high value of one 
variable, raises a problem that is difficult to control in 
calculating the intercorrelation matrix. Let us assume a 
situation where, within a certain range, a moderate positive 
correlation prevails between two variables, but when one or 
other obtain a high value the ipsativism load inevitably 
produces a low value in the other member of such an XY pair. 
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The situation might appear as in Figure 6. The effect of 
such exceptional XY pairs is so powerful through squaring 
that a few divergent pairs may change a moderate positive 
correlation into , a negative one. An effort can be made to 
control the situation by such methods as the following: 

1) Distributional transformations, by which the tail 
section of each distribution* is cut short (e.g. log X). 

2) Treatment of the values in tail sections as missing 
data, ' in which case values of the variables greater 
than the cut-off point are ignored when calculating 
corre lat ions . 

31 Rejection of observations which contain values forming 
thfc extreme tail section of the variables. 
Experiments with fictive examples have led me to propose Method 
No. 2 for further studies. Distributional transformations 
produce quite unexpected results in the cases where a diver- 
gent variable value is far from the remaining., relatively 
homogeneous group of scores. Met.hod No. 3 reduces the number 
of observations which is often too small already. Method No. 2 
is likely to give the best appromaximation of the "true" 
correlation between variables. Side effects harmful to the 
emergent correlation matrix will probably be quite insignific- 
ant . 

Figure 6. Influence of Ipsativism as a Scatter Diagram of 
Corre lat ion between two Variab les 



■• :'.N 



X 
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Some results obtained through -transformation analysis . 

■transformation analysis enables us to summarize in what manner 
^splitting the total interaction matrix into pa-^ts (see pp. 
17-18), and using the parts as observation units, will affect 
the emergent factor structure. What is the correspondence of 
factors, and what variables change most from one analysis to 
another? 

Four factorizations were made: 

11 Total" matrix as an observation unit whose results were 
reported earlier (Komulainen 1971a); 

2) 15' matrix .as an observation unit; 

3) 10' matrix as an observation unit; 

4) 5' matrix as an observation unit; 

In each case unities were placed in the diagonal spaces of 
the correlation matrix as h^ estimates. Principal axis solu- 
tions of six factors were rotated to the varimax criterion. 
These solutions were compared with each other by means of 
symmetrical transformation analysis. It may be mentioned as 
a general observation that the expected phenomenon - overage 
growth of correlations moving' toward the total matrix - was 
clearly apparent. This is seen from the average communalities 
of different analyses. 

Table 3. Average Communalities in Different Analyses 

Analysis , 
12 3 4 

Mean .eg ,SB .51 .43 

SD .12 .1.5 .14 .16 y 

Number of 50 50 50 50 

variables 

It is often asked whether the space activated by factors can 
be regarded as having the same structure in different cases, 
or whether the delineation of variables in factor space is 
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the same in different cases. Symmet ri cal ' transformation 
analysis is a technique presented by Mustonen (1966), which 
enables to obtain information on similarity of factor struc- 
tures and abnormal transformation of variables. The advantage 
of symmetrical transformation analysis over other analogous 
techniques tcf. Rummel 1970, 463 et seq.; Ahmavaara 1954; 
Ahmavaara & Nordenstreng 1970) is that the residual co-vari- 
ances of variables are the same whatever the direction of 
scrutiny. Orthogonal rotations preceding symmetrical trans- 
formation analysis do not affect the examination of trans- 
formation of variables. It is generally most advisable to 
make a transformation analysis after the final orthogonal 
rotations. Faccor correspondences (cosines of angles between 
axes) appear then directly from the transformation matrix, 
which in an ideal case is unit matrix I. 

Conf igurational invariance remains relatively high between 
the analyses (Tables 4-9). Factors I and VI are tne most 
unstable in this respect. An examination of transformation 
matrices L^j reveals that these factors have a tendency to 
become mixed with each other. Difficulties in interpretation 
(Komulainen 1971a, 16-25 and 27-30) and this result indicate 
that the number of rotated factors may have been too large 
cr that the situation is not favourable for an orthogonal 
solution . 

We shall next examine with which variables structural devia- 
tions are clearest of all. This is done by examining how the 
corresponding variables of factorized solutions behave when 
placed in pairs in the same space. The cosines of the angle 
between the vectors are used as descriptive quantities for 
structural deviations (Table 10^ columns 6-JJl. In Appendix 
2 the squared distances between vectors are also shown among 
different solutions. This is not a highly revealing measure, 
because a systematic shortening of the vectors was already 
noted when starting from the calculated total matrix solution 
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Table 4. Correspondences of Factor I between Different 
Solutions (teacher indirect -direct influence) 



1 



1 
2 
3 



2 

92 



.96 
.91 



.69 
.66 
.65 



Table 5. Correspondences of Factor II between Different 
Solutions ( reactive pupil activity] 



1 



1 

2 
3 



2 

.65 



3 
.91 
.63 



4 
.85 
.67 
.96 



Table 6. Correspondences of Factor III between Different 
Solutions (vicious circlej 



1 

2 
3 



.97 



. 69 
.96 



. 86 
.95 
.98 



I 

I" 

ft- 

1 
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Table 7. Correspondences of Factor IV between Different 
Solutions (no clear interpretation) 



1 
2 
3 



2 

.99 



.88 
.84 



.80 
.76 
.95 



Table 8. Correspondences of Factor V between Different 
Solutions (question -answer) 



1 

2 
3 



2 

.91 



.89 
.96 



.77 
.85 
. 86 



Table 9. Correspondences of Factor VI between Different 
Solutions (spontaneous pupil activity) 



1 



1 

2 
3 



2 

.92 



3 

.93 
.74 



.59 
.59 
.56 
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Figure 7. Example of Stuctural 



Deviation of Vari ables 




Structural deviations are centred on variables whose frequency 
is already low in the total matrix (most clearly for the 
variable No. 38). Factor structures remain highly invariant 
between solutions . Structural deviations revealed by vari- 
ables may be mostly >explained through the ceiling effect 
exerted by the changing -form of the distributions (cf. Guilford 
1965, 336-338; cf. also Cattell 1952, 322). The skewness of 
distributions and, consequently, the ceiling effect increases 
in strength with the shortening of the period from which the 
matrices for observation units are formed. The differences 
grow almost systematically and are in the total/5' comparison 
at their maximum. Results show, however, that information is 
not substantially contaminated if partial matrices are aggre- 
gated into a total matrix. For purposes of practical obser- 
vation a total matrix may well be used as a starting point 
for interpretation. 
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Table 10. Communal it ies of Solutions and Structural 
Deviations by Variables between Solutions 

Explanations of columns: 



( 1) 
(2) 
(3) 
(4) 
(5) 
(6) 



number of variable 

communality of variable in analysis 1 (total) 



2 
3 
4 



(15- ) 
(10' ) 
(5' ) 



( 7 1 




II 




N 


ff 


ff 


n 


ft 


^ 0 Q 


f fl 1 




II 




If 


ff 


ff 


w 




If 


A 0 A 
1 6 4 


( 9 1 




II 




ff 


ff 


ff 


M 




If 


TOO 

I h 3 


f 1 n 1 




n 




n 


ff 


ff 


n 




H 


^ & 4 


f 1 1 1 




n 




II 


ff 


ff 


w 




Iff 


Q 0 /I 

3 & 4 


f 1 1 


f 7 1 








I ID J 


( 7 > 


I tj J 


I 9 J 


( 10 J 


( A A \ 

(11) 


1 


.90 


. 83 


. 80 


.73 


.95 


.95 


.89 


.95 


.97 


.96 


2 


.71 


.53 


.45 


.32 


.93 


.95 


.83 


.90 


.86 


.73 


3 


. 63 


. 55 


.43 


.24 


.90 


.91 


.85 


.98 


. 78 


.76 


4 


.64 


.30 


. 30 


.26 


.94 


. 78 


.57 


.-92 


.80 


.93 


5 


.85 


.87 


. 79 


. 70 


.97 


.97 


.97 


.98 


.95 


.96 


6 


. 86 


. 80 


. 73 


.61 


.97 


.98 


.96 


.97 


.96 


.97 


7 


.69 


.55 


.60 


.55 


. 84 


.90 


.72 


.93 


.95 


.91 


8 


.52 


.35 


. 36 


.29 


. 86 


.74 


.67 


.95 


. 88 


.88 


9 


.75 


. 56 


.61 


. 54 


.96 


. 92 


.95 


.98 


.98 


.99 


10 


.63 


. 38 


. 34 


.19 


.95 


.92 


.75 


.95 


.75. 


. 86 


11 


.59 


. 62 


.28 


. 33 


.99 


.74 


.47 


.71 


.44 


.92 


12 


. 85 


. 78 


. 63 


.47 


.99 


.99 


.96 


.97 


.93 


.99 


13 


.58 


.35 


.46 


. 34 


. 76 


. 82 


.67 


.87 


.91 


.95 


14 


. 57 


.47 


.45 


.41 


.94 


. 88 


.75 


.90 


.80 


.96 


15 


. 69 


.62 


.61 


.48 


.98 


.96 


.86 


.99 


. 89 


.88 


16 


.49 


.40 


.47 


.37 


.75 


.71 


.69 


.96 


.84 


.87 


17 


. 59 


.46 


.25 


.17 


.98 


. 93 


.93 


.98 


.96 


.91 


18 


. 82 


.79 


. 70 


.63 


.98 


1 .00 


.98 


.98 


.95 


.99 


19 


. 86 


. 76 


. 62 


.58 


.98 


.93 


.85 


.96 


.93 


.93 


20 


. 59 


.41 


. 32 


. 39 


.95 


.93 


.72 


.90 


.74 


.88 


21 


. 64 


. 44 


.45 


. 32 


.90 


.87 


.83 


.97 


.94 


.96 


22 


.35 


.29 


.21 


.09 


. 83 


. 79 


.74 


.87 


.91 


.98 


23 


. 68 


.53 


.45 


. 37 


.95 


.92 


.76 


.99 


. 86 


.67 


24 


. 70 


.70 


.48 


.29 


.75 


. 82 


.74 


.93 


.85 


.95 


25 


. 60 


.48 


. 33 


.25 


.98 


. 86 


.79 


.78 


. 82 


.78 


26 


. 86 


.60 


.46 


.49 


.95 


.82 


.88 


.92 


.91 


.85 


27 


.78 


.65 


. 69 


.59 


.94 
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. 87 


.98 


28 


. 63 
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.91 
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29 
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.95 
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30 
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. 67 
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31 
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.62 


.41 


.95 
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32 


.64 


.35 
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.30 


.96 
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.91 


.90 
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.92 


33 


.50 


. 39 
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. 52 


.95 
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.S9 


.97 


.96 


.99 


34 


.74 


.53 


. 59 


. 60 
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.85 


.85 


.94 


.81 


.84 


35 


.67 


.40 


. 33 


. 14 


.97 


.92 


.76 


. 83 


.78 


.54 



ERJCI 



r 

I 

f. 



- 27 - 



Table 


10. 


(continued) 




f 1 1 
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36 
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.78 
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.64 



(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


n Q 
• U O 


. 9 1 


. 88 


. 88 


. 83 


.98 


n □ 


. 97 


. 88 


. 96 


. 83 


.94 


• 8/ 


.96 


. 30 


. 91 


.28 


.19 


. 9 5 


. 99 


. 99 


. 97 


.98 


.99 


• 99 


. 98 
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.97 
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. 89 


. 56 


. 94 


. 69 


. 74 


.98 


.91 


. 90 


. 89 


. 93 


.93 


.99 


.96 


.90 


. 93 


.87 


.94 


.94 


.86 


.76 


.97 


.87 


.94 


.94 


.88 


. 81 


.97 


. 88 


.95 


.93 


.88 


.91 


. 88 


.98 


.93 


.98 


.92 


.91 


. 95 


.88 


.90 


.87 


.83 


.73 


. 54 


.60 


.85 


.96 


.92 


. 96 


.93 


.98 


.97 



structural deviations observed in the variables, however, 
cause doubts as to whether factor scores can be used as indi- 
ces of the frequencies of sequence occurrence. In the first 
plase, regression coefficients used in the estimation of factor 
scores are susceptible to sampling fluctuation. Factor scores 
as they arise, therefore, are based for each sample on a unique 
set of regression coefficients. Comparisons between samples 
become difficult. ^In the second place, it must be seriously 
doubted wheter it is at all possible to interpret factors as 
sequences as explicitly as the writer has done . (Komulainen 
1971a, 16-25). Although the interaction matrix may reveal 
accumulation into certain cells, chains of greater length are 
hardly to be anticipated. It follows from this that the model 
sequences presented in the material scarcely at all, still 
less are they to be repeatedly indentified. A difficult meth- 
odological problem arises when chains are sufficiently similar 
in form to be regarded as belonging to the same type (cf. Hutt 
& Hutt 1970, 177-178; Cherry 1966, Ch.7). It is evident that 
indices calculated from interpretation areas of the matrix 
have a content which is similar and more concrete but fully 
adequate for practical application. 
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5. Sochastics of Classroom Communication 



5.1. General 



If the notion of hierarchic structure held by Rosenblueth et 
al. (194 3) is slightly modified, it can be easily recognized 
that interaction in the classroom has features which clas- 
sify it as active, purposeful, feedback-oriented, predictive 
and probabilistic in the range of behaviour. 



Figure 8. Range of Behaviour (modified from Rosenblueth et 
al., 1943, 223) 

Behaviour 

■ i> <\ 

Passive Active 

Non- purposeful Purposeful 
(random) ^ <^ 

Non- feedback Feedback 

Nun-predicti ve Predictive 

i> \ 

Deterministic Probabilistic 

(first, sec- 
ond etc. or- 
ders of 
prediction ) 



Because there are intentive, goal-oriented features in the 
classroom interaction which are clearly predictable in a prob- 
abilistic sense, it is quite natural to examine interaction 
in a manner based on the notion of probability. Communication 
theory and Finite Markov chains are the two main areas on 
which the following investigation is based. Infor^mation the- 
ory and Markov chains are not separate sectors. Indices e- 
volved as part of communication theory are specifically used 
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to illustrate the characteristics of each particular chain. 

The stochastic assumption implies that variability is itself 
conceptually included in the model and not merely added as an 
extrinsic error term to an equation whose primary structure 
has already been determined (Jaeckel 1971, 242). The stoch- 
astic assumption has long been linked to the concept of 
instruction (e.g. Winnefeld 1957, 30-44; Grassel 1970, 8-18). 

5.2. Finite Markov Chains 

In an earlier connection I have presented a background: for 
the use of Markov chains (Komulainen 1971b, 1-8). Mere defi- 
nitions of the concepts in use are given here. They follow 
mainly the practice suggested by Kemeny & Snell (1960). 

A finite Markov chain is a stochastic process which moves 
through a finite number of states, and for which the prob- 
ability of entering a certain state depends only on the last 
state occupied (cf. later expanded process). 

An ergotic set of states is a set in which every state can be 
reached from every other state, and which cannot be left once 
it is entered. 

An ergotic state is an element of an ergotic set. 

A cyclic chain is an ergotic chain in which each state can 
only be entered at certain periodic intervals. 

A regular chain is an ergotic chain that is not cyclic. 
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The chain of states arising from the classroom interaction 
in connection with coding is therefore mainly a regular chain. 
It contain no absorbing states (i.e. states which, once en- 
tered, are nsver left) and is not cyclic* 

Formed from a chain is a transition probability matrix, 
which is a square matrix (of order k x k; k = number of 
states), and whose row sums add up to unity. Calculable for 
states from the transition probability matrix are f i rst 
passage times matrix (FPT) and the corresponding vari ance 
[or standard deviation) matrix (SD of FPT). FPT quantities 
express the average number of steps before entering a certain 
state for the first time after the initial position. The 
figures of the SD of FPT, in turn, express variation observed 
in the number of steps around their corresponding means 
(Arosalo 1968). 

The Markov chain theory can be used for the examination of 
dependences extending back more than one step. From the orig- 
inal Markov chain we form a new Markov chain. A state in the 
new chain will be a pair of states in the old chain. Such a 
new chain is called expended process . By examining this process 
we obtain more detailed information, because it is based on 
conditional probabilities that are cirtermined by two preceding 
states in the original chain. The order of approximation in 
the chain under examination has no theoretical upper limit. 
Because the number of new states in the expanded process 
grows as a power function, there are practical limits. 

By combining the original states we can reduce their number 
to a smaller one, while still preserving essential features 
from the original process in a rougher form. With the number 
of states reduced in this way we pass to the examination of 



i 
i 
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higher approximation orders. Such a chain we call lumped 
process > 

In the present study one long chain with 14 660 successive 
steps was formed from the whole material, and on its basis 
the transition probability matrix was calculated. With the 
original number of states (k=13) an expanded process with 
169 states was formed, whose transition probabilities have 
been presented and examined earlier (Komulainen 1971b, 9-15). 
From the original system of 13 states a lumped process with 
5 states was formed, from which in turn expanded processes 
were formed which were bas.ed on ever higher orders of ap- 
proximation. Results concerning stochastics of classroom 
communication are based on this material. 

Table 11. Summarized Treatment of Results 

Results 

FPT & Indices from 
SD of inf ormat ion 
FPT theory 



Original 13 Py^^^ 

Expanded 169 p (y) no 



Original (k=13 ) Number Condi t- 

of ional 
states p 



Lumped (k-5 ) 



Lumped 


5 




no 


yes 


Expanded 


25 


Pwx^y^ 


no 


yes 


Expanded 


125 


Pvwx^y^ 


no 


yes 


Expanded 


625 


Puvwx^y^ 


no 


yes 


Expanded 


3125 




no 


yes 



- 32 - 



5,3, Information Theory and Its Underlying Concepts 

The origin of the mathematical theory of communication cannot 
be linked with any specific investigator. Elements of the 
theory can be observed in the work of several researchers 
(Cherry 1968, 41-52). The work of Hartley, Gabor, Wiener, 
Shannon, Weaver and others led to a relatively integrated 
system of concepts whose practical utility has been demon- 
strated in several fields (cf. e.g. Miller 1963). Information 
theory is a formal " as opposed to substantive - theory. It 
is not a model*' of behaviour, even communicative behaviour, but 
rather a tool that may be used in the construction of such 
models. Information theory is much more general than the ter- 
minology associated with it might suggest. Measures based on 
probability concept are applicable to many situations where 
terms like ''information transmission" are awkward or quite 
inappropriate (Frick 1959, 183-184). 

A precursor of information theory was probability theory, 

which started from intellectual schematizat ion of what happens 

in games of chance, and which has growr\ into one of the fun- 

* 

damental tools of exact science (Rapoport 1956, 141). 
Information theory suffers thus from all the conceptual 
difficulties associated with probability theory (Frick 1959, 
183). Information theory is f undamental ly a theory of selec- 
tion. Something is selected from a set of predefined alterna- 
tives. To examine the selective process it is essential to be 
able to examine the set. Scrutiny. of the characteristics of a 
Markov chain is based on transition probabilities between 
states in the set. With the indices of information theory it 
is attempted to describe the chain formation tendency of class- 
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room communication classified by the 13 categories FIAC 
system. These indices have very clear connections with the 
statistical measures regularly used (cf. e.g. Hays 1963, 
610-612; Ackoff 1956, 218; Miller 1963, 125). Information and 
redundancy are central concepts in these indices. 

InformatioQ . For Shannon, the amount of information contained 
in a message is the amount of freedom of choice involved in 
the selection of the message. A unit of choice is defined as 
the selection of one out of two equally available symbols. - 
Thus, in selecting one of two equiprobable symbols, one choice 
unit is involved and the resulting one-symbol message contains 
one unit of information (Ackoff 1956, 214.). The term "infor- 
mation" occurs in the theory of communication in a particular 
way. It is not ;synonymous with "meaning". Only the amount 
of information is measured - the amount does not specify the 
content, value, truthfulness, exclusiveness, history or pur- 
pose of the information (Miller 1963, 123). Cherry criticises 
the concept of information in the context of the mathematical 
theory of information as follows: 

"The measure for H (the amount of information) from 
Wiener and Shannon is applicable to the signs themselves, 
and does not concern their 'meaning'. In a sense it is a 
pity that the mathematical concepts stemming from Hartley 
have been called 'information' at all. The formula for H 
is really a measure of one facet only of the concept of 
information; it is the statistical rarity or 'surprise 
value' of a source of message-signs" (Cherry 1966, 51). 

Bit. The amount of information depends upon the fraction of 
the alternatives that are eliminated by the message. Every 
time the number of alternatives is reduced to half, one unit 
of information is gained, this unit is called one 'bit' of 
information. The amount of information in a message that 
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reduces k to k/x is log^x bits. To give a classic example, 
a child needs four bits of information in order to select 
with certainty from sixteen equally probable boxes the one 
which contains a sweet (log2l6=4 or -log2(1/16)=4 , if for 
each box p=1/16). Such a situation of equal probabilities 
represents maximal uncertainty (H^^^), 

Average amount of information (or uncertainty ). Since we 
wish to deal with sources rather than particular messages, 
we need a measure to represent how much information a source 
generates. If different messages contain different amounts 
of information, then it is reasonable to speak of the average 
amount of information per message we can expect to get from 
the source. This expected value is denoted; 



Its interpretation is the average minimum number of yes-no 
answers required to describe the selection of the signals 
from the predefined set (Cherry 1966, 176-162; Miller 1963, 
124). 

Redundancy . Evervts tnay have sequential connections in the 
sense that knowledge of a preceding result reduces the un- 
certainty of predicting a following result. The chain then 
contains a redundancy whose amount is expressible with the 
indices of information theory. Quantitatively speaking, the 
redundancy of a source is assessable only in relation to the 
known set of probabilities. Thus we can quote the redundancy 
of a source on a monogram or digram basis etc. The redundancy 
of a source may be quoted: 



k 



(5) 





i = 1 
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(6) R = ^ 

max 



where H = average amount of information 
of 1 , 2 etc . orders of 
approximat ion 

and H = average amount of information 
of zero order of approximation 



This index receives the value 1 when the chain contains no 
information (there is no freedom of choice, or the probabi- 
lity of selecting any one of a set of signs is l.o) or the 
information is fully predictable. If the chain's degree of 
organization is completely random, the index receives the 
value 0. If the course of action is coherent in the sense 
that future conduct depends upon past conduct, we say that 
the behaviour is predictable or, to some degree, stereotyped. 
In such cases the redundancy can be used to measure the degree 
of organization in the behaviour sequences (Frick 1959, 185; 
Miller 1963, 128). There is a simple yet very important 
theorem concerning statxb:ical constraints and redundancy, 
expressed by Cherry as follows: 

"If all the various transition probabilities p.(j) are 
equal, then the individual signs become statistically 
independent and equally probable. In such a case there 
are absolutely no preferred guesses as to what signs 
will be given out by the source; redundancy is provided 
by the existence of unequal transition probabilities... 
But notice that the converse does not hold: it is easy 
to arrange that all signs should be equiprobable, yet 
have unequal transition probabilities" (Cherry 1966, 
184). ^ 

4 

Order of approximation . A zero order approximation to the 
stochastic structure assumes that all events (signs) are equi- 
probable and independent of each other (=H ). A first order 
^ ^ max 

approximation is obtained by assuming that successive signs 
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are independent but have unequal probabilities (measured on 
an adequate sample of classroom interaction),* In second or- 
der approximation the conditional probabilities of the digram 
structure P^^Y^ 9^*6 known. In higher order approximations we 
further specify the conditional probabilities p^^(y) etc. 



Average amount of information in different orders of 
approximation . 



Order of approximation 
0 

1 

2 

3 

etc. 



Formula 

H = -log^d/k) 
max ^2 

H(x) = -£'Pi(log2Pi^ 

H^(y) = p. jClog^p. (j)) 

Similarly the information 
rate (or uncertainty) may 
be calculated for chains 
having trigram structure; 
etc. 



Relative reduction in uncertainty . The relat i ve reducti on in 
the average amount of information which Hays suggested (1963, 
612) has also been calculated. In this the object of compari- 
son is not the maximal state of uncertainty (H ), but the 

^ max 

preceding H value from its approximation. Then, for instance, 
the relative reduction in uncertainty in y given x is: 

H(y) - H (y) 

(7) R (y) = ^ 

H(y) 

When examining the chains we must stop at the point, compatible 
with the amount of data available, where uncertainty (average 
amount of information) is no longer being significantly 
reduced. Unfortunately, there are no statistical significance 
tests for the verification of this change. We shall return 
later to the fundamental phrase "compatible with the amount of 
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data available". 

5.4. First Passage Time Matrix [FPT) of the Original 
process with 13 States 

When we examine average step figures in the FPT matrix we 
observe clear differences between the columns, whereas the 
within-column variance is small. The between -co lumns variance 
is due to the relative frequencies of the states concerned. 
The more repetitive a state is, the fewer the number of steps 
by which it is reached: the state from which a start is made 
hardly affects the matter. As to the wi ' hin-columns variance, 
attention is drawn to the natural (and trivial) occurrence 
that the average number of steps from state x to state y 
decreases as the corresponding P^(y) increases. Correlation 
coefficients calculated between columns (FPT matrix and P^(y) 
matrix) vary between -.80 and -1.00, the median being -.95. 
From the dispersions corresponding to average values of the 
FPT matrix (Table 15) it is seen that when moving to the same 
state (1-1, 2-2 etc.) average steps vary less than when moving 
to other states. Dispersions also have a strong connection 
with the corresponding relative frequencies by categories. It 
is apparent from the calculated matrices of the Markov chain 
that they give no significant information on the structure of 
classroom communication. From the FIAC matrix they give 
trivial and redundant inforraat in'*, which is easy to read from 
p(y) and P^'V'^ values or even straight from transition fre- 
quency matrix. These descriptive quantities have been used 
with great success, however, to illustrate such matters as 
social rise (or decline) between generations, or to predict 
interprovincial migration in a certain district (Nowak 1964; 
Bartholomew 1967; Rikkinen 1971). The usef u InBSi- of the FPT 
matrix and the SD of FPT matrix for analysis of typical 
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classroom communication with instruments similar to FIAC is 
greatly reduced by the fact that transitions in the observa- 
tion chain are not separate, independent of each other, but 
the y of a preceding pair is always the x of a following pair. 
This produces an internal dependence which is clearly harm- 
ful to the expressiveness of the descriptive quatities of 
the Markov chain. 

Table 12. Transition Frequency Matrix 
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Table 13. Transition Probability 
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Table 14. First Passage Times Matrix (FPT) 
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Table 15. Standard Deviations of the First Passage Times 
(SO of FPT) 
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If dispersions in the SD of FPT matrix are examined, the 
ratio of figures to corresonding averages in the FPT matrix 
clearly shows that the distributions of the first passage 
times are positively skewed. This leads - intuitively, it 
is true - to two alternatives. Either (1) the information on 
behaviour obtained in natural situations is highly unstable 
(or the sampling error is considerable), or (2) parts of the 
chain can not be regarded as arising from the same population 
The former hypthesis is supported by Cattell's observation 
(1966b, 393-396) that situation variables generally have a 
far greater variance than test variables. Similarly, Winne- 
feld (1957, 30-32) emphasizes the openness, instability and 
tendency to disturbance of the pedagogic field. But what view 
would such a hypothesis disclose to the researcher? There- 
after one could study just as well the chain generated by the 
source of random numbers. Far more convincing is the notion 
that, above the microlevel of instruction, higher level 
constraints or frame factors can be found which provide us 
with a source of explanation for the disparity of chain compo 
nents. Guiding factors of this kind have been found, for 
instance, first, in the composition of a group of pupils and 
the theme dealt with (Lundgren 1972); second, in the process 
of clarifying objectives (Flanders 1970, Ch. 10) and in the 
cognitive treatment level of the subject matter (Taba et al. 
1964). Cattell also notes that it is useless to examine 
endogenous processes alone, because processes are environ- 
mental tied in most cases. 

"Thus nest-building in spring can be defined only in 
term both of the behaviour of the bird and of the 
unfolding of spring" Cattell 1966b, 394). 

Jackson (1962) and Koskenniemi (1972a) are among those who 
strongly emphasize external factors of a directly observed 
situation, especially the preparation and planning process 
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which precedes instruction. In this respect the purpose of 
constructing theories of instruction is to integrate the 
existing store of information , which is already considerable. 
Mere enumeration and grouping of the factors affecting the 
teaching-learning situation are not sufficent: theoretizat ion 
must also be capable of specifying the intervening mechanisms 
which enable us to explain and understand the connections 
between frame factors and the micro-level of instruction. 
It is clear that instruction ds in practice a more spontaneous 
less rational and therefore more unstable, process than the 
so-called preactive phase of instruction. If it is thought 
indispensable to remain in natural situations with greater 
external validity, the only possibility of reducing the part 
played by random error is greatly to increase the number of 
units studied. 

5.5. Information Rate (average amount of information or 

uncertainty) and Redundancy Measures of the Original 
Process 

As with the FIAC instrument, variables have been formed in 
different ways from the observation devices used. In small- 
group studies (e.g. Bales 1950; Mishler & Waxier 1968) 
variables have generally been formed from category profiles 
(i.e. first order approximation). In analysis of classroom 
communication, not only the category profile bu also a number 
of sum and ratio variables are almost always calculated: to 
form these the number of transitions must be known (second 
order of approximation). A factor of interest is the extent 
to which uncertainty decreases from one level of approximation 
to the following. Redundancy can be used as an index of the 
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UBtiec to which the later phases in a chain are predictable 
t r-c-m Its earlier ones. In addition to the redundancy column 
I iv^ 'rslative reduction in redundanc-y' column should be 
cLuUxfjd in the tables presented. Because the ratio of chain 
jcjr.gtn to number of categories (k) decreases continuously 
ds we advance to expanded processes of higher approximation 
orders, and because redundancy grows as a result of this 
trivial circumstance, a clear turning point in the redundancy 
curve is often recognizable in the relative reduction column. 
Later redundancy again grows strongly when the limit of 
statistical uniqueness is reached: there is only onechain 
of the length of that examined, so that all transition 
probabilities are 1.0. 

Figure 9. Connection of Redundancy with Degree of 

Approximation in Study of Finite Material 
(N=number of elements or steps in the chain) 

/.a 

redundancy 



0 i z s • • • ^-J ^-t ^ 

order of approximation 

For comparison the same indices were calculated for two other 
sets of material which were taped with the same class. Holo- 
painen (1971; 1972) analyzed with Bales' IPA 13 group work 
lessons videotaped during the academic year 1968~69. Indices 
illustrating the group work process were calculated on the 
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basis of Holopainen's results (1971, 117), converting them to 

log^ system and expressing them in bit units. With the same 

class in 1967-69 96 lessons of regular class instruction were 

videotaped. The Lessons were coded with Bales' IPA. All the 

25 lessons of this work coded with the FIAC system were 

included in these 96 lessons. The sets of material mentioned 

make some kind of a frame of reference for comparisons of 

predictability. It should be noted, , however, that H in the 

max 

96 lessons material is not log^lS (with 13 category IPA) but 
log226, because teacher and pupils are kept separately as 
actor and target in the interaction matrix. Bales' 13 
category modification by Holopainen (1972, 6) was employed 
in both sets of material. The meanings of the columns in the 
following tables are: 

(1) order of approximation 

(2) number of states (k) in the process 

(3) expected N/k; N= number of successive elements 
or steps in the cain 

(4) average amount of information or uncertainty (bits) 

(5) redundancy 

(6) relative reduction in redundancy 



Table 16. Redundancy in the Group Work Material of 



Holopainen (1971). 
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Table 17, Redundancy in the 96 Class Instruction Lessons 
Material 
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le 18. 


Redundancy of the 
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FIAC Chain 
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The main results expressed in Tables 16 - 18 are as follows. 
Redundancy in the chain illustrating group work does not 
increase after first order of approximation. The sharp rise 
in fourth order of approximation occurs because a limit has 
been reached in the N/k ratio in whose proximity conclusions 
cannot be drawn for such a short chain (8 400 acts vs. 
28 561 different combinations of four states). Any observed 
combination far exceeds the probability of its expected 
chance occurrence. In class instruction lessons the difference 
between first and second 'order of approximation is consider- 
able: in both cases the redundancy rises, for the second 
order to a value of ,41. According to Table 18, the redun- 
dancy no longer grows after this. For Table 17 material, 
unfortunately, no trigram structure was available so that 
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comparison for these parts is lacking. Results show that the 
interaction matrix contains information which should be taken 
into account in determining variables. This observation 
supports the decision, made at an earlier stage, to use 
millage matrix cells as variables. Work directed by a teacher 
is clearly more organized (stereotyped) in its sequences 
than group work. Interaction roles in the former are quite 
distinctly separated, and the impulses of pupils to direct 
the process must be restricted to some extent. When Bales' 
IPA is applied to examination of the task-oriented small 
group, it appears that the interaction profile contains all 
the information obtainable from analysis. It is quite 
conceivable that the age (grade) of pupils and their habit- 
uation to small group work are connected with the arrangement 
of interaction structure. 

In addition to the above, corresponding descriptive quantities 
for state chains beginning with 1, 2, ... 12 and 13 were 
calculated from the original material (FIAC). It should be 
noted that when measures of chains formed in this manner are 
examined, redundancy may fall in the higher orders of 
approximation; this is shown in column 6 as a negative quantity 
of the relative reduction of uncertainty. If the total chain is 
examined, this is seen impossible. With higher approximation 
orders redundancy may remain unchanged or grow, but it cannot 
diminish. How then does such a phenomenon occur when we 
examine such parts of the chain that are specified by the 
initiating state? As an example, let us take state chains which 
begin with 4. 

i-4; P3 P4 P5 Pe P7 Pe Pg P10 P11 P12 P13 

.01 .00 .00 .15 .00 .04 .04 .03 .39 .02 .01 .31 .00 
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Table 19. Redundancy of the Original 13-state Chain 

Specified According to the Initiating State 



1) 


(2) 


(3) 


(4) 


(5) 


(6) 


1 = 1 

J. ~ 1 












0 


13 


67.9 


3. 70 


.00 


.00 


1 

1 


1 3 


67 9 


3 . 08 


1 7 


1 7 


2 


169 


5.2 


2.07 


.44 




1 =2 












0 


13 


17.6 


3. 70 


.00 


.00 


1 

1 


1 3 

1 ^ 




^ . 3 u 


9 1 




2 


169 


1.4 


2.27 


. 39 


.22 


± ~ J 












0 


13 


43.7 


3. 70 


.00 


.00 


1 

1 






■? 1 fl 


4 1 

. H 1 


d 1 

. H 1 


2 


169 


3.4 


1 .91 


.48 


.12 


1 =4 












0 


13 


71.0 


3. 70 


.00 


.00 


1 

1 


1 3 


7 1 n 

/ 1 . u 


^ . ^ u 






2 


169 


5.5 


2.40 


. 35 


-.06 


J. J 












0 


13 


18.1 


3. 70 


.00 


.00 


1 


13 


16.1 


2. 11 


.43 


.43 


2 


169 


1.4 


1 .22 


.67 


.42 


i=6 












0 


13 


275.8 


3. 70 


.00 


. .00 


1 


13 


275.8 


1 .63 


. 56 


. 56 


2 


169 


21.2 


1 .66 


. 55 


-.02 


i = 7 












0 


13 


82.0 


3. 70 


. 00 


.00 


1 


13 


82.0 


2.71 


.27 


.27 


2 


• 169 


6.3 


2.26 


. 39 


.17 



- 47 - 



Table 19. 
(continued) 
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(meanings of columns on page 43) 
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We observe that 4-4 and 4-12 are the transitions which make 
the first order distribution of state chains beginning with 
4 as above illustrated. In fact it would be more correct to 
speak of second approximation order, because i = 4 is a 
condition for the relative frequencies of the following states 
although it is a constant. Intuitively it is clear that such 
a high predictability may decrease if the components of the 
chain are examined separately (specification made according 
to the initiating state). On the other hand, the average 
amount of uncertainty in the whole chain cannot grow in 
comparison with the previous approximation. Such a decrease 
in redundancy occurs particularly in the case of chains be- 
ginning with i in which the state frequency profile after i 
is especially uneven. Some p. may then be approaching unity, 
and the redundancy does likewise. In the distribution of the 
following states, however, no pj^ need necessarily come close 
to 1. 



Activity is most predictable after the starting states 5 
(broad question} and 6 (lecture). The predictability of the 
former may be due to the chain 5-5-12-12, because a broad 
question is often longer than normal and leads to a pause 
for reflection before the pupil speaks. The latter owes its 
predictability directly to the 6-6-6-6 nature of the "gives 
information** category . Pg(6)= .73 reflects this characteris- 
tic. Least predictable is behaviour after 10 (spontaneous 
relevant student talk). Such an input of the pupil - because 
of its content and other factors assessed by the teacher - 
contains many further possibilities. Such spcntaneous stimu- 
lus demans that the teacher decide quickly how to react. 
The situation produced by quick decisions obviously leads to 
great variations in what follows. In activity after 4 (narrow 
question) the small extent of predictability is surprising. 
One would expect drill-type exercises, for instance, to be 
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particularly stereotyped. The fact is, however, that ques- 
tions in accordance with category 4 are highly varied - 
80 % of questions asked belong to this category. The dis- 
criminating power of the FIAC version in use with regard to 
the cognitive level of a question is far too modest. Stereo- 
typed drill obviously covers only a small part of category 4. 

Regarded as a whols, chains examined componenti ally give the 
same picture of the situation as does the whole material. 
Redundancy grows, to be sure, as we move to second order 
approximations, but a vital turning point in the reduction 
of uncertainty is reached within the distribution of cate- 
gories after the specified initial state. 

5,6. Redundancy in the Lumped Process 

The examination of expanded process presupposes that either 
the material hes a notably great number of transitions (e.g. 
100 OOO - 300 000), or a clustering of original states to 
new states has to be done. 



For the present material the latter method, the lumped pro- 
cess, was chosen. Former states were combined to form new 
states as follows: 
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From the new chain formed from combined states it was possihie 
to examine even 5-gram structure. The expected nean frequency 
was still 4.7 per combination, which then number 3,125. 

Table 20. Transition Frequency Matrix of the Lumped Process 
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Table 21. Transition Probability Matrix of the Lumped Process 
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Table 20 shows that the logical clustering of t.h.t, original 
states has been successful, because the relative frequencies 
of the new states in the lumped process are very homogeneous 
(p^ » .18, p^ = .25, p^ « .13, p^ .24 and p^ = .211. The 
monogram distribution produces no notable redundancy, but 
the reduction of uncertainty remains dependent on conditional 
transition probabilities. From the five-state chain obtained 
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expanded chains were formed as far as 5-gram structure. 

It is observed from the digram transition probability matrix 
that the so-called steady-state cells on the diagonal contain 
very high probabi lities. When a substantial proportion of the 
variations are homogenized in the clustering, the stereotyped 
nat ure of class instruction is emphasized more than before. 
Some law of continuity controls instruction directed by a 
teacher. Thus predictability from the dyads is strongly em- 
phasized in comparison with the profile. Predictability has 
been examined in the same manner as in the original 13-state 
process. Here too the chain was dispersed into the chain 
groups following each state, in which the same examinations 
were made. 
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Table 22. Redundancy of the Lumped 5-state Chain in 

Total and Specified According to the Initiatin 
State (j) 
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Table 22. 
(continued) 
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(meanings of columns on page 43) 



Figure 10. Relationship between Redundancy and Order 
of Approximat ion 



redundan 




order of appro)< nation 
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Examination of the lumped process convincingly reveals that 
no important reduction of uncertainty occurs from digram to 
trigram structure in the material. Total uncertainty based 
on recognition of dy^ds is 1.82 bit/code. Recognition of 5- 
gram 'Structure reduces this to 1.54 bit/code, but this is a 
phase in which the amount of material produces a spurious 
decline in the average amount of information. Structures of 
different orders of approximation from the lumped process, 
specified according to the starting state j of the chain 
U ranging from 1 to 5), are examined in Table 22 and, with 
regard to the growth of redundancy, in Figure 10. The re- 
dundancy in the chains beginning from state 2 behaves excep- 
tionally. Negative signs will be observed in the column 
"relative reduction in uncertainty" of the Table (Table 22; 
j = 2). The reason for this has been given in Section 5.5., 
pp. 45-48. Examining the chains which begin from state 2 
(lecture), we notice that the distribution of the following 
states is dominated by a tendency to stay in the steady state 
(p^ = .73), provided, that is, that the mon gram structure 
is studied with 2 as the starting state. Lecturing and giving 
information are, by their nature, activities which require a 
certain temporal continuity. The probability of leaving this 
state grows, however, when the number of successive 2-states 
increases. This is why redundancy in a case like this de- 
creases in higher orders of approximation. 

Regarded as a whole, the general impression received from 
the lumped process is very similar to that given by the 
original 13-state chain. Uncertainty is not substantially re- 
duced if we move to higher approximation orders from the 
digram structure. Predictability of the course of instruction 
is r^lativelu poor, and lasts only for a average of 5-7 
seconds. It should be remembered that when comparing the bit 
figures of column 4 in Tables 16, 17, 18, 19 and 22 the rel- 
ative nature of the concept is to be taken into account. The 
bit figures are naturally smaller in the lumped process, be- 
cause the selection i^s between 5 states instead of 13. 
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6. Oiscussion 

6,1 , On Seq u 6 n t i a 1 An alysis of Behaviour 

Knowledge of the elements of behaviour is obtained by obser- 
vation. The elements must be specified in advance by means of 
a taxonomy. The basis of classification may be either logical 
or empirical, often also a combination of both. In most cases 
taxonomies are deduced from theoretically derived concepts. 
These concepts and constructs are then operationally defined 
through the classification system and those indices which can 
be derived from it. A good example of the connection between 
theory and taxonomy is the lAC system of Flanders. Measurement 
by a taxonomy of this kind is never "objective": events are 
seen in the framework of a particular taxonomic solution (cf. 
Koort 1972). Within different taxonomies the determination of 
variables has been based on frequency of occurrence, duration, 
intensity etc. The fact that behaviour is structured makes it 
easier for us to predict what behavioural elements will occur 
in a specified environment. These elements are not in a random 
sequence. Some occur more frequently in temporal juxtaposition 
with each other than with others. Any sequence of events be- 
tween which conditional probabilities exist may be called a 
stochastic process. The most common stochastic processes are 
Markov chains. In the first place four models have been used 
to deal with statistical dependencies between behavioural 
events . 

2 

The X model . Many ethologists (cf. Hutt & Hutt 1970, 1711 but 
also small-group investigators (e.g. Steinzor 19491 have tab- 
ulated observations of behaviour in the form of a contingency 
table. They employ the technique of calculating the expected 
value for each cell, using the hypothesis of statistical 
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independence. The expected value for each cell is obtained 
from the ratio: 

row total X column total 
grand total 

Any "major discrepancy* between expected and observed value 
will give indications of affinity between the acts. Ap- 
plying the simple X we can define in statistical terms what 
we mean by a "major discrepancy". Thus this type of analysis 
forms a valuable initial step in studying the dependencies 
amongst sequences of behaviour. 

The factor analytic; model . This is the model employed in va- 
rious parts of this study. Wiepkema (1961) was one of the 
first to demonstrate the possibilities of factor analysis in 
this field. Later Coats, Gess, Soar and the present writer 
have all independently applied factor analysis to study of 
the interaction matrix (cf. Flanders 1970, 412-413). The mod- 
el gives a highly promising opportunity to form the variables 
of a lesson. Further study must be directed to the examina- 
tion of the validity of the variables. 

The informational model . The previous mo dels make use of the 
digram structure of behaviour. The indices of mathematical 
communication theory enable us to examine the connections of 
longer state chains. The-model holds a central position in the 
present study. Such exhaustive analyses were made because in 
the works of researchers using FIAC or similar category sys- 
tems there is some indication of models based on transition 
probabilities, Markov chains in particular. It seem probable, 
however, that such models do not open new possibilities for 
the format ion of variables. 
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The phrase structure grammar model . It has been questioned 
whether Markov processes are the most appropriate model for 
the comprehensive description of behaviour. It has been sug- 
gested that models derived from psycholinguistics might be 
particularly worthy of study. For instance. Miller et al. 
(1960) have stressed the role of plans directing the behav- 
iour. Elsewhere Miller (1962) gives a graphic example of the 
determination of syntactic structure. The article by Miller 

(1962) together with a short monograph by Chomsky (1957; cf. 
also Chomsky 1964) provide an adequate introduction to the 
subject. According to available information only Marshall 
(1965) in reanalysis of the material of Fabricius & Jansson 

(1963) has used generative grammar for the formal analysis 

of structured behavioural sequences. A phrase structure gram- 
mar is then composed which is capable both of generating the 
requisite behaviour patterns and of representing the hier- 
archical organization of these patterns. The strength of this 
model lies in its ability to deal with a considerably greater 
behavioural repertoire and much longer sequences than can be 
treated by Markovian processes alone. No applications to class- 
room communication are known. To what extent the determination 
of variables by this means becomes fruitful, is a matter which 
remains to be seen. 

6.2. Some Critical Points of View 

Stationary vs. non-stationary sources . Mathematical communi - 
cation theory can be applied to situations in which organism 
adopts a reasonably stable course of action that can be de- 
scribed statistically. Statistical stationariness means that 
if the observer watches the sequence for a very long time T, 
the relative frequency estimates p(i) of the various signs he 
makes will not depend upon the actual moment of starting, i.e. 
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the statistical properties of a stationary source are invari- 
ant under a shift of the time origin. This assumption of sta- 
tionariness is normally required in statistical communication 
theory, and is one of its present limitations. This is one of 
the main obstacles to the application of the theory to human 
interactive behaviour (Cherry 1966, 179). 

Delayed constraints . There remains one interesting question. 
To what extent does behaviour in the classroom communication 
network depend upon any particular past event independent of 
intervening more recent past events? 

Crossing const raints . The time order of events is not always 
their logical or true succession. During the instruction pro- 
cess an interruption occurs (e.g. a spontaneous, irrelevant 
pupil suggestion), and the interrupted sequence continues per- 
haps after three or four acts. Similarly, and especially dur- 
ing lessons containing periods of individual guidance, a cross- 
ing interaction is noticeable whose time succession does not 
correspond to the logical or intended succession. In a long 
chain this may be considered a random phenomenon whose diver- 
gences will balance out before long. There are some clearly 
recognizable situations, however, in which the occurrence of 
such crossing successions is more frequent than usual. 

Non-Markovian constraints . Chomsky (1957; 1964) has s hown that 
certain types of constraints that occur in human communication 
cannot be generated by any left-to-right finite state Markov 
chain. The existence of nested grammatical structures in lan- 
guage implies that the individual's present behaviour is ba- 
sed upon his plan for future behaviour and on his prediction 
of the behaviour of the other members of the social group 
(altmann (1965, 519). Dbservation of redundancy does not tell 
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us what factors cause statistical dependence in the chain. The 
goal is certainly one of the explanatory frame factors which 
steer the process of interaction. In its further investigations 
the Helsinki IPA research group has paid vigorous attention 
to the goal-oriented nature of instructional situations. 
By way of preliminary experiments during the autumn term of 
1972 a series of instructional periods was held in the school 
class of the Institute of Education, University of Helsinki; 
the first phase of this consisted of joint planning within a 
framework defined by the teacher. Planning is directed both 
to aims and modes of work (Koskenniemi (1972a). 

Generali ty vs. uniqueness . A comment by Cherry will serve as 
introduction to the problem: 

"The gathering of monogram and digram statistical data 
involves an immense amount of labour, and with trigram, 
quadrigram and so forth this becomes increasingly 
prohibitive; lest the reader should feel that we are 
merelyquibbling and avoiding a point of principle, 
that given time, patience, unlimited cash and computing 
machines we could collect 10-gram or 100-gram word 
transition probabilities of Shakespeare's writings, 
the following point should be stressed. The limitation 
IS not mere labour of letter or word counting: it is 
the fact that there are not enough books . The data 
gathered would not be statistical; they would be 
Shakespeare's actual lines and verses , for each seque nce 
would occ ur only once " (Cherry 1966, 40; underlining bv — 
Jthe present author). 

The problem is how to examine the classroom communication in 

our obviously similar present-day conditions. Coded with a 

13-category instrument for tally/3 seconds, chain variations 

of one minute duration may occur to the number of about 
22 i-i" 

1.9 X 10 . Here too, however, the purpose of measurement is 
to produce variable values which will indicate essential fea- 
tures of the instructional situation, not to reproduce the 
original sequence of behavioural events in numerical series. 
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Variables which are too general have not proved useful, as 
Landsheere (1971) indicates in his criticism of research on 
instruction. In my view the factor-analytical approach offers 
a fruitful starting point for the formation of variables from 
sequential behaviour: superfluous uniqueness is removed, and, 
on the other hand, over-simplifying generalization is avoided 
of the dimensions of classroom communication. 

A note on research strategy . In the last few years there has 
been a spirited debate on the strategy followed in the research 
on teaching. This was started by Stolurow (1965), and it has 
led to discussions on active vs. passive or improving vs.- de- 
scriptive approach. Both trends have their supporters. Ebel 
(1969) argues that we should gather data which describe the 
present state of education and then let experience, reson, 
dialogue and consensus - rather than controlled experimenta- 
tion - guide us to educational improvement. As Gage and Unruh 
put it: 

"The long-range assumption of describers is that once 
important correlates of teacher effectiveness in the 
present-day classroom have been ascertained, it will 
be possible to train teachers to be more effective. 
They take the conver."*" ional classroom as given, and 
seek to improve the behaviour of the teachers in it" 
(Gage and Unruh 1969, 4). 

•Jackson (1966), Bellack ( 1966), Smith and Meux (1970) etc. 

also start with a "passive" approach. The improvers, taking 

an "active" approach , attempt rather to master the teaching 

model, i.e. to develop a new model of instruction that will 

make explicit the manipulable elements and relationships 

needed to optimize learning. Experimentation will correct 

and refine the model (Gage & Unruh 1969, 5). The ideas of 

active approach supporters are compressed by Travers as 

follows: 
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"Useful knowledge about the conditions that are related 
to learning is not likely to be derived from the obser- 
vation of naturally occurring teaching situations. A much 
more profitable approach is to construct genuine labor- 
atory experiments involving class room -like situations, 
but with a much higher degree of control over experimen- 
tal conditions than was found in some of the older forms 
of educational experiment that compared method A with 
"method B. Once the study of teaching has developed tech- 
niques that approximate those of an experimental science, 
the difficulties at present associated with the wide- 
spread usage of ill-defined terms ceases to exist" 
(Travers 1971 , 39 ). 

The tendency to move from correlative to experimental designs 
is seen with other researchers who have a more moderate atti- 
tude to the subject (e.g. Flanders 1970; Nuthall 1970 and 
1972). It should be borne in mind, however, that the ap- 
proaches presented here are not opposed to each other be- 
cause they produce different kinds of information. There 
should be no orthodoxy in this matter. Both should have a 
place in the field of research -on teaching. 



6.3. Continuation of Study 



I mention here two directions in which I intend to continue 
research into the instructional process. The necessary material 
has been collected, but analysis and reporting are not yet 
completed. For reasons of space the following introduction 
is mere ly schemat i c . 



PJ^an^A. At the Institute of Education, University of Helsinki, 
the classroom behaviour of the same set of pupils was fol- 
lowed up for two academic years (1969-71) during class instruc- 
tion lessons in different subjects. Analyses were made by a 
modified 23-category subscripted FIAC system, which can, if 
required, be reduced to the regular ten categories (Flanders 




- 62 - 



1966; 1970, 126-152). Independently of the above, the same 
lessons were analyzed by Bales* IPA (1950) and a slightly 
modified version of the coding system developed by Bellack 
and his associates (Bellack et al. 1966; Karma 1972). The 
aims are: 

(1) To recapitulate with more extensive material 

the factor- analytical and information -theoretical 
examinations which have already been performed . 

(2) To perforrri an examination between taxonomies in 
order to formulate more precisely the validity 

of the sequence factors within the context offered 
by other taxonomi es . 

The data matrix to be analyzed is the following: 

FIAC Bales* IPA Bellack 

variables variables variables 



1 
2 

75 



Plan B . In autumn 1971 a new form of entrance examination was 
set up for the selection of elementary school teacher candi- 
dates. In this connection a research team led by Associate 
Professor Alikoski gathered a considerable body of information 
on the applicants. Most of this had no effect on the composite 
entrance requirement score on which selection was based. As a 
new instrument of selection a so-called teaching episode was 
used for the first time, replacing the "instruction test" 
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used in earlier years (cf. Koskenniemi et al. 1965, 553-556). 
The teaching episode was about 10 minutes in length, the 
pupils were from the fourth grade, and the theme of instruc- 
tion with necessary material was given to the candidate an 
hour in advance. Performance was rated in three dimensions by 
three independent judges. The teaching episode had consider- 
able influence on the variance of selection scores. The pre- 
sent writer arranged stereo voice recordings of all the teach- 
ing episodes performed by applicants for the Helsinki Teacher 
Training College. These were analyzed by the 23-category sub- 
scripted FIAC system and the Bellack system. One hundred and 
sixty persons took part on the entrance examination, and half 
were allowed to continue their studies in the training insti- 
tute. The aim now is to follow the studies of those admitted 
by obtaining similar information on the final phases of their 
training in the spring term of 1974. This study is correlative, 
chief attention being directed to connections between process 
variables (behaviour measured by teaching episode) and other 
groups of variables. Material has been collected for the first 
phase. Groups of variables for this study are as follows (cf. 
Sandven 1967) : 



(A) 



Personal data, earli er 
teaching experience etc. 



(B) 



Abi lity, attitude and 
person a li ty variables 




Possible follow-up in final 
phase of training 



Teaching episode: FIAC, 
Bellack 



Candidate's own perception 
of teaching episode 



Committee *s appraisal of 
teaching episode 
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In the further study which is here introduced, and whose 
details cannot be elaborated for reasons of space, use wi 
be made of the data on formation of variables which were 
obtained in the present study. 
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Appendix 2. 

Stru ctural Deviations for Corresponding Variables betwee 
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Appendix 2. 
(continued) 
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